Challenges You Will Face When Parsing PDFs with Python
theseattledataguy.comยท2hยท
Discuss: Hacker News
๐Ÿ“„PDF Archaeology
Preserving the digital legacy of company archives: Last stop, Newhaven.
dpconline.orgยท9h
๐Ÿ’พData Preservation
Top 11 Document Parsing AI Tools for developers in 2025
dev.toยท2dยท
Discuss: DEV
๐Ÿ“„Document Digitization
Point, Don't Point
ilovetypography.comยท2dยท
Discuss: Hacker News
๐Ÿ“œDocument Paleography
Converting a PDF to text locally with Ollama
huijzer.xyzยท2d
๐Ÿ‘๏ธOCR Verification
Digital Forensics Jobs Round-Up, September 15 2025
forensicfocus.comยท2h
๐ŸšจIncident Response
OTW - Bandit Level 4 to Level 5
tbhaxor.comยท11h
๐Ÿ”งKAITAI
Machine-learning tool gives doctors a more detailed 3D picture of fetal health
news.mit.eduยท3h
๐ŸบComputational Archaeology
Semantic Dictionary Encoding
falvotech.comยท2hยท
Discuss: Hacker News
๐ŸŒ€Brotli Dictionary
I asked ChatGPT to imagine my life as a 90's kid with access to AI, and the results were eye-opening
techradar.comยท6h
๐Ÿ’พvintage computing
Bookends 15.2
tidbits.comยท1h
๐Ÿ“„PostScript
UTF-8 Is Beautiful
hackaday.comยท12h
๐Ÿ”ฃUnicode
Lessons from using AI in Discovery
thoughtbot.comยท17h
๐Ÿ•ต๏ธMetadata Mining
Kindred and Co-located Events: PARBICA 21 Demystifying Digital
ipres2025.nzยท15h
๐Ÿ›๏ธNordic Archives
WorldCat Editions and Holdings Release
annas-archive.orgยท1dยท
Discuss: Hacker News
๐Ÿ“šMARC Records
Show HN: Semlib โ€“ Semantic Data Processing
github.comยท3hยท
Discuss: Hacker News
๐ŸŒณIncremental Parsing
New data from OpenAI and Anthropic show how people actually use ChatGPT and Claude
the-decoder.comยท1h
๐Ÿ“ŠFeed Optimization
Learn How to Use Transformers with HuggingFace and SpaCy
towardsdatascience.comยท3h
๐ŸŽฏDependent Parsing
Language Models Pack Billions of Concepts into 12,000 Dimensions
nickyoder.comยท13hยท
๐ŸงฎKolmogorov Complexity